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ppnrT^ ^j i ^ TOR OXT mjfilMG TERPEMES 
The invention relates to a process for 
enzymatically oxidising terpenes and cycloalkenes . 

Terpenoid compounds are widespread in biological 
systems and constitute one of the largest class of 
natural products. They are major constituents of 
essential oils, some of which are of considerable value 
in the flavour and perfume industries. Many terpenoids 
are also biologically active. Some are anti-bacterial and 
anti-fungal agents and thus are of great interest to the 
pharmaceutical industry. Indeed, terpenoids are some of 
the highest added value chemicals. 

The terpenoids of commercial interest are not 
normally the terpenes themselves, but rather derivatives 
which commonly require stereoselective functionalisation 
at allylic as well as non-activated carbon-hydrogen bonds 
of the parent terpene. This type of chemical 
transformation is one of the most difficult reaction to 
carry out by conventional methods of chemical synthesis - 
the highly reactive chemical oxidising agents required 
are non-selective and typically they will preferentially 
attack more activated carbon-hydrogen bonds and reactive 
functional groups such as olefinic double bonds commonly 

present in terpenes. 

The present invention concerns the enzymatic 
oxidation of terpenes and cycloalkenes. This technique 
enables the synthesis of hydroxylated terpenes (and 
cycloalkenes), often in a single step, and provided that 
the match between substrate and enzyme is correct, the 
oxidation reaction can be highly chemoselective (attack 
at a particular functional group such as a non-activated 
C-H bond rather than some other reactive functional 
group) and stereoselective. The fine tuning and 
alterations of substrate specificity and selectivity of 
substrate oxidation are very difficult to achieve for 
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conventional reagents. 

The present invention provides a process for 
oxidising a substrate which is an acyclic or cyclic 
terpene or a cycloalkene/ or a substituted derivative 
thereof, which process comprises oxidising said compound 
with a mutant haem-containing enzyme / the mutant 
comprising the substitution of an amino acid in the 
active site by an amino acid with a less polar side- 
chain. 

Although the terpenes used in the present invention 
will generally have the formula (C5HB)n where n is 2 or 
more, especially 2, 3 or 4, it is to be understood that 
the term '"terpene" extends to compounds which are 
strictly referred to as terpenoid'' , involving the loss 
or shift of a fragment, generally a methyl group. Thus, 
for example, sesquiterpenes (where n is 3) which can be 
used in the present invention may contain only, say, 14, 
rather than 15, carbon atoms. Generally the terpene is 
one which can be built up from isoprene units. The 
terpene may be cyclic or acyclic. 

The monoterpenes (where n is 2) will generally have 
10 carbon atoms, typically with 1 to 3 double bonds, 
especially 1 or 2 ring double bonds, and typically with 0 
to 2 rings. It is possible for one of the rings to be 
formed as a bridge containing, typically 0 or 1 carbon 
atoms. In other words, it can be formed by a direct link 
between 2 carbon atoms of an existing ring or with an 
intermediate methylene group. If the terpene is acyclic 
it will generally contain at least 2 double bonds and 
generally 3. 

The sesquiterpenes will normally contain 14 or 15 
carbon atoms, typically with 0 to 2 double bonds and 
typically 1 to 3 rings, with the possibility of fused 
rings and/or bridged rings. 

The rings which may be present in the terpenes will 
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typically have from 3 to 9 carbon atoms, more especially 
5 or 6 carbon atoms. Thus, in particular, the terpenes 
will contain a cyclohexane, or cyclohexadiene ring. 

The terpenes will generally contain a total of 3 or 
4 exocyclic methyl or methylene groups, for example 2 
methyl groups and 1 methylene group or 3 methyl groups 
for a monoterpene, and 3 methyl groups and 1 methylene 
group or 4 methyl groups for a sesquiterpene. 

The monoterpene is typically a limonene, pinene, 
terpinene, sabinene, thujene, mercene, ocimeme, nerol or 
geraniol, for example as shown in Table 1. 

The sesquiterpene is generally formed by a head-to- 
tail arrangement of three isoprene units. The 
sesquiterpene is typically an aromadendrene, 
caryophyllene, longifolene, valencene, isobazzanene, 
silphinene, ishwarane, isopatchchoul-3-ene, or 
isosesquicarene, for example as shown in Table 2. 

The diterpene (where n is 4) is typically casbene, 
retinal, abietic acid or a gibberellin. 

The cycloalkene generally comprises up to 9 ring 
members, e.g. it is a 5, 6, 7, 8, 9 or more membered 
ring. The cycloalkene is typically a cyclohexene. 

Substituted derivatives of any of the terpenes or 
cycloalkenes mentioned above may also be used. Typically 
1, 2, 3 or more substituents are present. 2toy 
combination of the following substituents may be present. 
The substituent is typically a halogen atom or an alkyl 
or alkenyl group, which generally has 1 to 6 carbons, the 
substituent optionally being substituted with one or more 
halogens. It is generally not phenylcyclohexene. Indeed 
the presence of aromatic, such as phenyl, rings is 
generally avoided for all the substrates used in the 
invention. 

The substituent typically has the formula CnHkX„, 
wherein X is the halogen, n is 1, 2, 3 or more, m is 1, 
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2, 3, 4 or more and k is an integer which has an 
appropriate value so that the valencies of the 
substituent C^R^X^ are satisfied. For an alkyl 
substituent k+m = 2n+l. Typically k is 1, 2, 3, 4 or 
5 more, or may be 0, i.e. the substituent is a perhaloalkyl 
group. The halogen is typically fluorine, chlorine or 
bromine . 

The substituent may also comprise 1, 2 or more 
oxygen atoms and for example may be an alcohol, aldehyde, 

10 ketone or epoxide group. 

The oxidation causes the formation of a C-0 bond 
in the compound, generally as the hydroxide from the 
oxidation of a carbon-hydrogen bond, but an epoxide may 
be formed from the oxidation of a C=C bond. The 

15 oxidation may thus introduce a hydroxy, aldehyde, ketone 
or epoxide group. Alternatively the oxidation may cause 
the further oxidation of an oxygen containing group, such 
as converting a hydroxy group into an aldehyde or ketone 
group. 1, 2 or more carbon atoms may be attacked in the 

20 same siibstrate molecule. 

The oxidation typically gives rise to 1, 2 or more 
oxidation products. These different products may result 
from different carbon atoms being attacked and/or from 
different degrees of oxidation occurring at a given 

25 carbon atom. 

The oxidation may occur on either a ring carbon 
atom or a substituent carbon atom or both. At least the 
initial oxidation will involve attack of a C-H bond which 
may be activated or non-activated or attack at a carbon- 

30 carbon double bond (typically giving an epoxide) . 

Generally an activated C-H bond is where the carbon atom 
is in a benzylic or allyl position. Aromatic rings and 
olefinic double bonds activate C~H bonds to attack by 
stabilising the radical intermediate or any build-up of 

35 charge generated during the reaction pathway. The 
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carbon of the C-H bond may be a primary, secondary or 
tertiary carbon. 

The oxidation typically preserves stereoisomerism. 
Thus when the substrate consists of a single stereoisomer 
5 the product typically consists of a single corresponding 
stereoisomer, or can contain a preponderance of the 
corresponding stereoisomer. 

The enzyme used in the process is generally a P450 
enzyme, typically of eukaryotic or prokaryotic origin. 

10 The enzyme is generally of bacterial, fungal, yeast, 

plant or animal origin, and thus may be from a bacterium 
of the genus Pseudomonas. The enzyme is typically a 
monooxygenase . The non-mutant form of the enzyme may or 
may not be able to oxidize terpenes and/or cycloalkenes . 

15 The mutations discussed herein are generally 

introduced into the enzyme by using methods known in the 
art, such as site directed mutagenesis of the enzyme, PGR 
and gene shuffling methods or by the use of multiple 
mutagenic oligonucleotides in cycles of site-directed 

20 mutagenesis. Thus the mutations may be introduced in a 
directed or random manner. The mutagenesis method thus 
produces one or more polynucleotides encoding one or more 
different mutants. Typically a library of mutant 
oligonucleotides is produced which can be used to produce 

25 a library of mutant enzymes. 

An amino acid Un the active site' is one which 
lines or defines the site in which the substrate is bound 
during catalysis or one which lines or defines a site 
through which the substrate must pass before reaching the 

30 catalytic site. Therefore such an amino acid typically 
interacts with the substrate during entry to the 
catalytic site or during catalysis. Such an interaction 
typically occurs through an electrostatic interaction 
(between charged or polar groups) , hydrophobic 

35 interaction, hydrogen bonding or van der Waals forces. 
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The amino acids in the active site can be 
identified by routine methods to those skilled in the 
art. These methods include labelling studies in which the 
enzyme is allowed to bind a substrate which modifies 
5 ('labels') amino acids which contact the substrate. 

Alternatively the crystal structure of the enzyme with 
bound substrate can be obtained in order to deduce the 
amino acids in the active site. 

The enzyme may have 1, 2, 3, 4, 5 to 10, 10 to 20 

10 or more other mutations, such as substitutions, 

insertions or deletions. The other mutations may be in 
the active site or outside the active site. Typically 
the mutations are in the 'second sphere' residues which 
affect or contact the position or orientation of one or 

15 more of the amino acids in the active site. The 

insertion is typically at the N and/or C terminal and 
thus the enzyme may be part of a fusion protein. The 
deletion typically comprises the deletion of amino acids 
which are not involved in catalysis, such as those 

20 outside the active site (thus the enzyme is a mutated 

fragment of a naturally occurring enzyme) . The enzyme may 
thus comprise only those amino acids which are required 
for oxidation activity. 

The other mutation in the active site typically 

25 alters the position and/or conformation of the substrate 
when it is bound in the active site. The mutation may 
make the site on the substrate which is to be oxidized 
more accessible to the haem group. Thus the mutation may 
be a substitution to an amino acid which has a smaller or 

30 larger, or more or less polar, side chain. 

The other mutations typically increase the 
stability of the protein, or make it easier to purify the 
protein. They typically prevent the dimerisation of the 
protein, typically by removing cysteine residues from the 

35 protein (e.g. by substitution of cysteine at position 334 
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of P450c,„, or at an equivalent position in a homologue, 
preferably to alanine) . They typically allow the protein 
to be prepared in soluble form, for example by the 
introduction of deletions or a poly-histidine tag, or by 
5 mutation of the N-terminal membrane anchoring sequence. 
The mutations typically inhibit protein oligomerisation, 
such as oligomerisation arising from contacts between 
hydrophobic patches on protein surfaces. 

Thus the mutant enzyme is typically at least 70% 

10 homologous to a naturally occurring haem-containing 
enzyme on the basis of amino acid identity. 

Any of the homologous proteins (i.e. described as 
being homologous to another protein) mentioned herein are 
typically at least 70% homologous to the relevant protein 

15 or at least 80 or 90% and more preferably at least 95%, 
97% or 99% homologous thereto over at least 20, 
preferably at least 30, for instance at least 40, 60 or 
100 or more contiguous amino acids. The contiguous amino 
acids may include the active site. This homology may 

20 alternatively be measured not over contiguous amino acids 
but over only the amino acids in the active site. 

Homology can be measured using known methods. For 
example the UWGCG Package provides the BESTFIT program 
which can be used to calculate homology (for example used 

25 on its default settings) (Devereux et ai (1984) Nucleic 
Acids Research 12, p387-395) . The PILEUP and BLAST 
algorithms can be used to calculate homology or line up 
sequences (typically on their default settings), for 
example as described in Altschul S. F. (1993) J Mol Evol 

30 36:290-300; Altschul, S, F et aJ (1990) J Mol Biol 
215:403-10. 

Software for performing BLAST analyses is publicly 
available through the National Center for Biotechnology 
Information (htto: //www.ncbi.nlm.nih.aov/) . This 
35 algorithm involves first identifying high scoring 
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sequence pair (HSPs) by identifying short words of length 
W in the query sequence that either match or satisfy some 
positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is 
5 referred to as the neighbourhood word score threshold 
(Altschul et al, supra) . These initial neighbourhood 
word hits act as seeds for initiating searches to find 
HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the 

10 cumulative alignment score can be increased. Extensions 
for the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X 
from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or 

15 more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm 
parameters T and X determine the sensitivity and speed 
of the alignment. The BLAST program uses as defaults a 
word length (W) of 11, the BLOSUM62 scoring matrix (see 

20 Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci. USA 

89: 10915-10919) alignments (B) of 50, expectation (E) of 
10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm performs a statistical analysis 
of the similarity between two sequences; see e.g., Karlin 

25 and Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873- 
5787. One measure of similarity provided by the BLAST 
algorithm is the smallest sum probability (P(N)), which 
provides an indication of the probability by which a 
match between two nucleotide or amino acid sequences 

30 would occur by chance. For example, a sequence is 

considered similar to another sequence if the smallest 
sum probability in comparison of the first sequence to 
the second sequence is less than about 1, preferably less 
than about 0.1, more preferably less than about 0.01, and 

35 most preferably less than about 0.001. 
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Typically the homologous protein differs from the 
relevant protein by at least 1, 2, 5 or 10 mutations 
(substitutions, insertions or deletions) when compared to 
all of the protein or over any of the lengths of 
contiguous amino acids mentioned above. 

The enzyme used in the process is preferably a 
mutant of P^SO^^^ (such as mutant of the sequence shown in 
table 7) or a mutant of a naturally occurring homologue 
of P450c^„, typically of P450bm.3 from Bacillus megaterium 
(such as a mutant of the sequence shown in table 8), 
P450^„p from Pseudomonas sp, and P450eryF from 
Saccharopollyspora erythraea^ and also P450 105 Dl 
(CYP105) from Streptomyces griseus strains. Note that 
the amino acid numbering shown in table 8 for P4 50bm-3 
does not correspond to the numbering used in the 
description to denote mutations in this enzyme. The 
sequence shown in table 8 contains an additional amino 
acid at the N terminal. This is normally cleaved in 
vivo. Therefore each amino acid number shown in the 
table is always one more than the number used in the 
conventional numbering (as used in the description) . 

The naturally occurring homologue of P450cam (e.g. 
of P450bm-3) niay have substantially the same activity as 
P450cam or P450b„.3, The homologue may be a species 
homologue or an allelic variant of P45Gcam from 
Pseudomonas putida or of P450Bf^.3. The amino acids in the 
active site of the homologue may be the same as in the 
active site of P450c3„ or of P450b„.3. Typically the amino 
acid at the equivalent position to 96 in P450catt is a 
tyrosine in the homologue. 

The mutant of P450ca„ or of a homologue of P450ca„, is 
typically one in which amino acid 96, or the equivalent 
amino acid in a homologue, has been changed to an amino 
acid with a less polar side chain. In the case where the 
homologue is P450bm-3 the mutant typically has a 
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substitution (to a less polar amino acid) at 47 and/or 51 
and/or 42 and/or 75 and/or 354 and/or 264 and/or 263 
and/or 181 and typically does not have a mutation at the 
equivalent site to 96 of P450eam (preferred mutants of 
5 P450b„.3 have at least mutations at 47 and 51, or at the 
equivalent sites in homologues) . 

Thus typically the substitution is to an amino acid 
which is above the original amino acid in Table 3, such 
as the preferred mutations shown in Table 4 and Table 5. 

10 The 'equivalent' side chain in the homologue is one 

at the homologous position. This can be deduced by lining 
up the P450c3„ or P450bm-3 sequence and the sequence of the 
homologue based on the homology between the two 
sequences. The PILEUP and BLAST algorithms can be used to 

15 line up the sequences. The equivalent amino acid will 

generally be in a similar place in the active site of the 
homologue as any of the specific amino acids discussed 
herein, such as amino acid 96 in P450cam- 

The discussion below provides examples of the 

20 positions at which substitutions may be made in P450cai„ 
and P450b„-3. The same substitutions may be made at 
equivalent positions in the homologues. Standard 
nomenclature is used to denote the mutations. The letter 
of the amino acid present in the natural form is followed 

25 by the position, followed by the amino acid in the mutant 
(these positions can be correlated to the numbering shown 
in tables 7 and 8 with the proviso discussed above with 
regard to table 8 amino acid numbering) . To denote 
multiple mutations in the same protein each mutation is 

30 listed separated by hyphens. The mutations discussed 

below using this nomenclature specify the natural amino 
acid in P450^,„ or P450b„-3 but it is to be understood that 
the (same) mutation could be made to a homologue which 
has a different amino acid at the equivalent position. 

35 An additional mutation is typically an amino acid 
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substitution at amino acid 87, 98, 101, 185, 244, 247, 
248, 296, 395, 396 of P450,,, (or a combination of these, 
for example as shown in table 4) . 

The following combinations of substitutions are 
preferred for P450ca„: 

(i) Substitution at position 87 to amino acids of 
different side-chain volume, such as substitutions 

(typically of F) to A, L, I and W, combined with 
substitutions at position 96 to amino acids of different 
side-chain volume such as (typically Y to) A, L, F, and 
W. These combinations alter the space available in the 
upper part of the substrate pocket compared to the wild- 
type enzyme, for example, from Y96W-F87W (little space) 
to Y96A-F87A (more space), as well as the location of the 
space, for example from one side in Y96F-F87A to the 
other in Y96A-F87W. 

(ii) Substitution at position 96 to F combined with 
substitutions at positions 185 and 395. Both T185 and 
1395 are at the upper part of the substrate pocket, and 
substitution with A creates more space while substitution 
with F will reduce the space available and push the 
substrate close to the haem. 

(iii) Substitutions at position 96 to A, L, F, and W 
combined with substitutions at residues closer to the 
haem including at 101, 244, 247, 295, 296 and 396 to A, 
L, F, or W, These combinations will create or reduce 
space in the region of the different side-chains to offer 
different binding orientations to substrates of different 
sizes. For example, the combinations Y96W-L244A and 
Y96L-V247W will offer very different substrate pockets 
for the binding of R-limonene. 

(iv) Triple substitutions at combinations of positions 
87, 96, 244, 247, 295, 296, 395 and 396 with combinations 
of A, L, F, and W. The aim is to vary the size and shape 
of the hydrophobic substrate binding pocket. For 
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example, the Y96A-F87A-L244A combination creates more 
space compared to the Y96F-F87W-V396L combination, thus 
allowing larger terpenes to bind to the former while 
restricting the available binding orientations of smaller 
5 terpenes in the latter. The combinations Y96F-F87W-V247L 
and Y96F-F87W-V295I have comparable substrate pocket 
volumes, but the locations of the space available for 
substrate binding are very different. The combination 
Y96F-F87L-V247A has a slightly larger side-chain volume 

10 at the 96 position than the combination Y96L-F87L-V247A, 
but the L side-chain at the 96 position is much more 
flexible and the substrate binding orientations will be 
different for the two triple mutants, 
(v) The mutants with four or five substitutions were 

15 designed with similar principles of manipulating the 

substrate volume, the different flexibility of various 
side-chains, and the location of the space available in 
the substrate pocket for terpene binding so as to effect 
changes in selectivity of substrate oxidation. 

20 The invention also provides the mutant of P450caB» or 

a mutant of a homologue of P450ca,„ (such as P450bm-3) as 
discussed above, excluding mutants of P450ca„ which only 
have the mutations: 

F87A-Y96G-F193A, F87A-Y96G-F193A-C334A or T101M-T185F- 
25 V247M. 

The mutant enzyme may be in a substantially 
isolated form and/or a substantially purified form, in 
which case it will generally comprise (e.g. about or at 
least) 90%, such as (e.g. about or at least) 95%, 98% or 

30 99% of the protein in the preparation. 

The invention also provides a polynucleotide which 
comprises a sequence which encodes the mutant enzyme of 
the invention. The polynucleotide is typically DNA or 
RNA, and may be single or double stranded. The 

35 polynucleotide may be able to hybridise with a 
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polynucleotide encoding the naturally occurring form of 
any mutant discussed herein (each the polynucleotide 
shown in table 7 or 8) . it typically hybridises with the 
relevant polynucleotide at a level significantly above 
5 background. The signal level generated by the 

interaction is typically at least 10 fold, preferably at 
least 100 fold, as intense as 'background' hybridisation. 
The intensity of interaction may be measured, for 
example, by radiolabelling the probe, e.g. with ^^P. 

10 Selective hybridisation is typically achieved using 
conditions of medium to high stringency (for example 
0.03M sodium chloride and 0.03M sodium citrate at from 
about 50°C to about 60°C) . 

Polynucleotides of the invention can be 

15 incorporated into a recombinant replicable vector. The 
vector may be used to replicate the nucleic acid in a 
compatible host cell. Thus one method of making 
polynucleotides of the invention comprises introducing a 
polynucleotide of the invention into a replicable vector, 

20 introducing the vector into a compatible host cell, and 
growing the host cell under conditions which bring about 
replication of the vector. The vector may be recovered 
from the host cell. Suitable host cells are described 
below in connection with expression vectors, 

25 Preferably, a polynucleotide of the invention in a 

vector is operably linked to a control sequence which is 
capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an 
expression vector. 

30 The term "operably linked" refers to a 

juxtaposition wherein the components described are in a 
relationship permitting them to function in their 
intended manner. A control sequence "operably linked" to 
a coding sequence is ligated in such a way that 

35 expression of the coding sequence is achieved under 
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conditions compatible with the control sequences. 

Such vectors may be transformed into a suitable 
host cell to provide for expression of the mutant enzyme. 
The vectors may be, for example, plasmid, virus or 
5 phage vectors provided with an origin of replication, 
optionally a promoter for the expression of the said 
polynucleotide and optionally a regulator of the 
promoter. The vectors may contain one or more selectable 
marker genes, for example an ampicillin resistance gene 

10 in the case of a bacterial plasmid or a neomycin 

resistance gene for a mammalian vector. Vectors may be 
used in vitro, for example for the production of RNA or 
used to transfect or transform a host cell. 

Promoters and other expression regulation signals 

15 may be selected to be compatible with the host cell for 
which the expression vector is designed. For example, 
E.Coli promoters include iac, tac, trc, trp and T7 
promoters, and yeast promoters include S. cerevisiae GAL4 
and ADH promoters, S. pombe nmtl and adh promoters. 

20 Mammalian promoters include the metallothionein promoter 
which can be induced in response to heavy metals such as 
cadmium. The expression vectors are possible for use in 
insect or mammalian cells. For use in insect cells, 
strong baculovirus promoters such as the polyhedrin 

25 promoter are preferred. For expression in mammalian 

cells, strong viral promoters such as the SV40 large T - 
antigen promoter, a CMV promoter or an adenovirus 
promoter may also be used. All these promoters are 
readily available in the art. 

30 Expression vectors of the invention are typically 

introduced into host cells using conventional techniques 
including calcium phosphate precipitation, DEAE-dextran 
transf ection, or electroporation. 

The expression vector may contain a selectable 

35 marker and/or such a selectable marker may be co- 
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transfected with the expression vector and stable 
transfected cells may be selected. 

Suitable cells include cells in which the 
abovementioned vectors may be expressed. Such cells may 
5 be prokaryotic or eukaryotic. These include microbial 

cells typically bacteria such as E. coli^ preferably the 
strains DHs^, JM109, NM522 and BL21DE3 or Pseudomonas, 
typically putida, mammalian cells such as CHO cells, COS? 
cells or HeLa cells, insect cells or yeast such as 

10 Saccharomyces . Baculovirus or vaccinia expression 
systems may be used. 

Cell culture can take place under standard 
conditions. Generally the cells are cultured in the 
presence of assimible carbon and nitrogen sources. 

15 Commercially available culture media for cell culture are 
widely available and can be used in accordance with 
manufacturers instructions . 

Typically the process of the invention is carried 
out in vitro/ such as in a cell free system. The process 

20 may be carried out in vivo in a cell. 

Typically, in addition to the enzyme (a) and the 
siibstrate the process of the invention is carried out in 
the presence of an electron transfer reductase (b) , an 
electron transfer redoxin (c) , cofactor for the enzyme 

25 and an oxygen donor. In this system the flow of 

electrons is typically: cofactor (b) -> (c) (a) . 
However particular enzymes do not require the presence of 
an electron reductase and electron transfer redoxin, such 
as P450bm-3. Although the following discussion is 

30 particularly directed to enzymes which do require a 

reductase or redoxin, it is applicable to enzymes which 
do not require these (for example the various 
concentrations, conditions and rates are suitable for 
these enzymes) . 

35 For enzymes which do require a reductase and 
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redoxin (b) is generally an electron transfer reductase 
which is able to mediate the transfer of electrons from 
the cofactor to (c) , such as a naturally occurring 
reductase or a protein which has homology with a 
5 naturally occurring reductase, such as at least 70% 

homology; or a fragment of the reductase or homologue. 
Thus (b) may be derived from any of the organisms listed 
above from which the haem-containing enzyme may be 
derived, (b) is typically a flavin dependent reductase, 

10 such as putidaredoxin reductase. 

(c) is generally an electron transfer redoxin which 
is able to mediate the transfer of electrons from the 
cofactor to (a) via (b) . (c) is typically a naturally 
occurring electron transfer redoxin or a protein which 

15 has homology with a naturally occurring electron transfer 
redoxin, such as at least 70% homology; or a fragment of 
the redoxin or homologue. Thus (c) may be derived from 
any of the organisms listed above from which the haem- 
containing enzyme may be derived, (c) is typically a two- 

20 iron/ two sulphur redoxin, such as putidaredoxin. 

The cofactor is any compound capable of donating an 
electron to (b) , such as NADH. The oxygen donor is any 
compound capable of donating oxygen to (a) , such as 
dioxygen. 

25 In the process the concentration of (a) , (b) or (c) 

is typically from 10"® to 10"^M, preferably from 10"^ to 
10"*M. Typically the ratio of concentrations of (a) : (b) 
and/or (a): (c) is from 0.1:01 to 1:10/ preferably from 
1:0.5 to 1:2, or from 1:0.8 to 1:1.2. Generally the 

30 process is carried out at a temperature and/or pH at 

which the enzyme is functional, such as when the enzyme 
has at least 20%, 50%, 80% or more of peak activity. 
Typically the pH is from 3 to 11, such as 5 to 9 or 6 to 
8, preferably 7 to 7.8 or 7.4. Typically the temperature 

35 is 15 to 90X, such as 25 to 75"C or 30 to 60*C. In one 
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f ollowing: 

H2N-P450„„-TDGTSST-putidaredoxin reductase-TDGASSS- 
putidaredoxin-COOH/ 

H2N-P450e3:n-TDGTRPGPGPGPGPSST-putidaredoxin 
5 reductase-TDGASSS-putidaredoxin-COOH, 

H2N-P4 50e5„,-TDGTRPGPGPGPGPGPGPSST-putidaredoxin 
reductase-TDGASSS-putidaredoxin-COOH, 

H2N-putidaredoxin reductases-TDGASSS- 
putidaredoxin-PLEL-P450ca;n-COOH. 
10 However it is understood that the excluded E.Coll 

DH5a cells can be used to produce the libraries discussed 
below. 

A preferred cell (second type) is a cell which 
expresses : 

15 (a) (i) P450bm-3/ or a fragment thereof; or 

(ii) a naturally occurring homologue of 
P450bj^.3 or a fragment thereof; or 

(iii) a mutant P450bm.3, or a mutant 

homologue of thereof. 

20 The cell provided by the invention is typically a 

cell from the species mentioned above in which the 
nucleotide of the invention can be expressed. The cell 
may be a mutator cell. Such a cell is generally 
deficient in one or more of the primary DNA repair 

25 pathways (such as E.Coli pathways mutS, mutD or mutT, or 
their equivalents in another organism) , and thus has a 
high mutation rate. Simply culturing such cell leads to 
the DNA encoding (a) to become mutated. The cell may be 
of E.Coli XLl Red mutator strain. 

30 The cell of the invention may be in a substantially 

isolated form and/or substantially purified fonti/ in 
which case it will generally comprise (e.g. at least or 
about) 90%, such as (e.g. at least or about) 95%, 98% or 
99% of the cells or the mass (normally measured in terms 

35 of dry mass) of the preparation. 
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embodiment the process is carried out in the presence of 
a substance able to remove hydrogen peroxide by-product 
(e.g. a catalase) . 

Alternatively the process of the invention could be 
carried out in the presence of the enzyme, substrate and 
an oxygen atom donor, such as hydrogen peroxide or t- 
butylhydroperoxide. Thus, the process could be carried 
out using the peroxide shunt. 

Typically in the process at least 20 turnovers/min 
occur, such as at least 50, 100, 200, 300, 500 or more 
turnovers (turnover is measured as nanomoles of product 
formed per nanomole of enzyme) • 

The invention also provides several types of cells. 
The first type expresses : 

an enzyme which can be used in the process which in 
its naturally occurring form has an electron transfer 
reductase domain; or expresses 

(a) the mutant haem-containing enzyme which is used in 
the process of the invention; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 

The second type of cell expresses: 

(a) (i) P450^3„ or a fragment thereof; or 

(ii) a naturally occurring homologue of P450caa, or 
a fragment thereof; or 

(iii) a mutant of F450^^; or 

(iii) a polypeptide which has at least 70% amino 

acid homology with (i) or (ii) and optionally 
has any of the combination of mutations 
discussed herein; and 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin: 

excluding an E.Coli DH5a cell in which the only 
mutants of P450^,„ which are expressed are amongst the 
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The cell may be one which does not naturally 
express (a), (b) or (c) . The cell be one in which (a), 
(b) or (c) are expressed at a higher level than in the 
naturally occurring cell. (a) may originate from the 
same organism as (b) or (c) . 

In the cell (a), (b) and (c) may be expressed from 
the same vector, or may be expressed from different 
vectors. They may be expressed as three different 
polypeptides. Alternatively they may be expressed in the 
form of fusion proteins. Typically components (a), (b) 
and (c) are all present in the same fusion protein. 
Alternatively only two of the components, preferably (b) 
and (c) , may be present in the fusion protein. Typically 
the components are contiguous in the fusion protein and 
there is no linker peptide present. 

Alternatively a linker may be present between the 
components. The linker generally comprises amino acids 
that do not have bulky side chains and therefore do not 
obstruct the folding of the protein subunits. Preferably 
the amino acids in the linker are uncharged. Preferred 
amino acids in the linker are glycine, serine, alanine or 
threonine. In one embodiment the linker comprises the 
sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C. The linker is 
typically from at least 5 amino acids long, such as at 
least 10, 30 or 50 or more amino acids long. 

The first type of cell may be obtained by 
transforming or transfecting a host cell with a 
polynucleotide or vector of the invention. 

The mutant enzyme of the invention may be prepared 
by a process comprising cultivating the first type of 
cell under conditions to provide for expression of the 
said mutant enzyme, and optionally recovering the 
expressed mutant enzyme. 

The process of the invention may be carried out in 
the first type of cell of the invention or in the second 
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type of cell if it is able to oxidize the substrate/ or a 
medium containing it. Generally such a process comprises 
providing the substrate in the cell, allowing the 
substrate to be oxidized in the process of the invention, 
5 and optionally obtaining the oxidation product therefrom, 
e.g. by extraction. The substrate is typically provided 
in the cell by adding the substrate to the outside of the 
cell and allowing it to enter the cell. Alternatively the 
substrate could be synthesized in the cell from a 

10 precursor. 

The invention also provides a process for making a 
library of mutants of P450ca„ or mutants of a homologue of 
P450cam comprising contacting the second type of cell with 
a mutagen and/or when the cell is a mutator cell 

15 culturing the cell in conditions in which mutants are 
produced. The mutagen may be contacted with the cell 
prior to or during culturing of the cell. Thus the 
mutagen may be present during replication of the cell or 
replication of the genome of the cell. 

20 The mutagen generally causes random mutations in 

the polynucleotide sequence which encodes (a) . The 
mutagen is typically a chemical mutagen, such as 
nitrosomethyguanidine, methyl- or ethylmethane sulphonic 
acid, nitrite, hydroxylamine, DNA base analogues, and 

25 acridine dyes, such as proflavin. It is typically 

electromagnetic radiation, such as ultra-violet radiation 
at 260 nm (absorption maximum of DNA) and X-rays. It is 
typically ionising radiation. 

Typically the library will be in the form of cells 

30 which are derived from cells of the invention by 

mutagenesis and which cells comprise the mutant enzymes. 
Generally each cell will express only one particular 
mutant enzyme. The library typically comprises at least 
500 mutants, such as at least 1,000 or 5,000 mutants, 

35 preferably at least 10,000 different mutants. 
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The library typically comprises a random population 
of mutants. The library may undergo one or more rounds 
of selection whilst being produced and therefore may not 
comprise a random population. Between rounds of 
selection the cells in the library may be allowed to 
replicate, and they may also be contacted with a mutagen. 

A mutant can be selected from the library based on 
a particular property of the mutant. The property may 
comprise one or more of the following characteristics: 

(i) the ability to oxidize a particular substrate; 
optionally to a particular oxidation product or to a 
product with a particular activity. 

(ii) the ability to carry out the oxidation of substrate 
at an increased rate, 

(iii) a reduced oxidation activity towards a particular 
substrate, 

(iv) a reduction in the production of a particular 
substrate. 

Typically the activity of the product in (i) is 
blocking the action of an agent which is lethal to the 
cells of the library. This can be selected by growing 
the library in the presence of the agent. The agent is 
typically expressed within the cells of the library. 

The activity may be the binding of the product to a 
particular substance, such a protein. The substance is 
typically present in the cell of the library, and/or is 
typically a disease-causing or therapeutic target. An 
indicator which binds the substance is typically used to 
detect binding of the product to the substance. In one 
embodiment the indicator is able to bind the substance 
and has a property which changes upon binding, e.g. a 
colour change. Product which displaces the indicator 
from the substance can thus be detected by measuring 
changed in the property. 

The invention also provides a method of making a 
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library of oxidation products comprising providing a 
substrate to the library of mutant enzymes and allowing 
oxidation of the substrate. 

Products produced in the process of the invention, 
5 or identified, selected, made or designed using the 

library could be used in therapy or diagnosis. Thus the 
invention provides a method of treating a host suffering 
from a disease, which method comprises administering to 
the host a therapeutically effective amount of the 

10 product. The condition of a patient suffering from the 
disease and in need of the product can therefore be 
improved by administration of the product. The product 
can also be given as a prophylactic, typically to a host 
which is at risk from or susceptible to the disease. 

15 The invention provides the product for use in a 

method of treatment of the human or animal body by 
therapy. The invention also provides the product for use 
in a diagnostic method practiced on the human or animal 
body. The invention also provides use of the product in 

20 the manufacture of a medicament to treat a disease. 

The formulation of the product for use in 
preventing or treating infection by an organism will 
depend upon factors such as the nature of the product 
identified, whether a pharmaceutical or veterinary use is 

25 intended, etc. In order to be administered to a patient, 
the compound will be provided in the form of a 
pharmaceutical composition containing the product and a 
pharmaceutically acceptable carrier or diluent. Suitable 
carriers and diluents include isotonic saline solutions, 

30 for example phosphate-buffered saline. Typical oral 
dosage compositions include tablets, capsules, liquid 
solutions and liquid suspensions. For example it may be 
formulated for parenteral, intravenous, intramuscular, 
subcutaneous, transdermal or oral administration. 

35 The dose of product may be determined according to 
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various parameters, especially according to the substance 
used; the age, weight and condition of the patient to be 
treated; the route of administration; and the required 
regimen. A physician will be able to determine the 
5 required route of administration and dosage for any 

particular patient. A suitable dose may however be from 
0.1 to 100 mg/kg body weight such as 1 to 40 mg/kg body 
weight, for example, to be taken from 1 to 3 times daily. 
The invention is illustrated by the accompanying 

10 drawings which show the gas chromatography results for 
various oxidation reactions. In the drawings unless 
stated otherwise the y-axis shows mVolts and the x-axis 
shows Time (in minutes) . 
In the drawings : 

15 Figure 1 shows the oxidation of camphor by the 

C334A mutant of P450„^ (expressed from plasmid SGB++) . 
Lines A, B, C, D and E represent camphor turnover at 2, 
10, 20, 40 and 100 minutes. The 5.28 peaks is camphor, 
11.82 is 5-exo-hydroxycamphor, 7.45 is 5-ketocamphor and 

20 the 16.07 peak is the internal standard. 

Figure 2 shows R- and S- limonene with wild type 
P450b„.3, 

Figure 3 shows NADH consumption by wild type and 
mutant P4 500,^.3. 

25 Figure 4 shows the whole cell E. coli oxidation of 

a-pinene by the Y96F-F87W-V247L mutant expressed by 
plasmid pCWSGB+, 

The invention is also illustrated by the Examples: 
Example 1 

30 Expression of mutants for ±n vitro work. 

The P450cam enzymes were expressed using the vector 
PRH1091 (Baldwin, J.E., Blackburn, J.M., Heath, R.J., and 
Sutherland, J.D. Bioorg. Med. Chem. Letts., 1992, 2, 663- 
668.) which utilised the trc promoter (a fusion of the 

35 trp and Jac promoters) . This vector incorporates a strong 
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ribosome binding site (RBS) and the gene to be expressed 
is cloned using an Nde I site on the 5' end of the gene. 
We used Hind III as the cloning site at the 3' end of the 
camC gene. The procedure for protein expression is as 
5 follows: Cells are grown at 30**C until the ODgoonm reaches 
1.0 - 1.2, the temperature is increased to ST'^C and 
camphor added as a 1 M stock in ethanol to a final 
concentration of 1 mM. The culture is allowed to incubate 
at 37°C for another 6 hours. The P450ca™ protein is 

10 expressed to high levels in the cytoplasm and the cells 
take on a red to orange-red colour. 

We have also prepared a variant of pRH1091 (by PGR) 
which has a extra Xba 1 site between the RBS and the JVde 
I site. This is important because Nde I is not unique in 

15 M13, and this restriction site is also present in the 
reductase gene as well as the backbone of the pGLWll 
vector used for the in vivo system. Xha I is unique in 
the polylinker region of M13, but absent in the genes of 
all three proteins in the P450cani system and in the 

20 expression vectors. It therefore allows the camC gene to 
be moved between the mutagenic and expression vectors. 

The P450bm-3 enzyme from Bacillus megaterium was 
expressed using either the pGLWIl or pCW vectors. The 
recombinant plasmid with the P450bm_3 gene inserted into 

25 either of these vectors were transformed into E. coli 
strain DH5a and grown under ampicillin selection. A 
single colony was then grown at 30°C in LB media 
supplemented with ampicillin until the OD at 600 nm 
reached ca. 1, and protein expression was induced by 

30 adding IPTG from a 1 M stock to a final concentration of 
ImM. After 6-8 h, cells were harvested by centrifugation 
and the expression levels were high/ as indicated by an 
orange-red coloration of the cells. 
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How the mutants were made. 

Oligonucleotide-directed site-specific mutagenesis 
was carried out by the Kunkel method {Kunkel, T. A. Proc. 
Natl. Acad. Sci. USA 1985, 82, 4 88-492) using the Bio-Rad 
5 Mutagen kit and by PGR using the QuikChange kit from 
Stratagene. The recommended procedure for the Kunkel 
method is summarised as follows. An M13 mpl9 subclone of 
the camC gene encoding P450j.a„ was propagated in the E. 
coli strain CJ236. This strain has the dut"ung* phenotype 

10 and thus will tolerate the inclusion of uracil in place 
of thymine in DNA molecules. After three cycles of 
infection, uracil-containing single stranded (USS) M13 
DNA was readily isolated by phenol extraction of mature 
M13 phage particles excreted into the growth medium. The 

15 mutagenic oligonucloetide (or oligonucleotides) were 
phosphorylated with T4 polynucleotide kinase and then 
annealed to the USS template. The four nucleotides, DNA 
polymerase, DNA ligase, ATP and other chemical components 
were added and the second strand was synthesised in 

20 vitro. The double stranded form thus obtained was 

transformed into the dut+ ung+ E. coli strain MV1190, 
which should degrade the uracil-containing template 
strand and propagate the mutant strand synthesised in 
vitro. Plaques were picked and phages of possible mutants 

25 grown in E. coli strains MV1190 or TGI. The single- 
stranded DNA from these were sequenced to determine 
whether the mutagenesis reaction was successful. The 
mutagenic efficiency was 50 - 80%. The mutant camC gene 
is excised from the M13 subclone by restriction digest 

30 with Nde I and ^find III, and the fragment of appropriate 
size is ligated to the backbone of the expression vector 
prepared by a similar Nde I/Hind III digest. 

The QuikChange kit relies on the property of the Dpn 
I restriction enzyme which selectively cleaves methylated 
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DNA. The mutation is introduced by PCT using double 
stranded plasmid DNA, and hence no single stranded 
template preparations are necessary. The PGR reaction is 
carried out with two oligonucleotides, one of which binds 
to the coding strand and the other to the sense strand. 
Each oligonucleotide contains a short stretch of 
polynucleotide complementary to either side of the 
mutation site. After in vitro synthesis by PGR using 
non-methylated dNTP's, plasmid DNA with overlapping nicks 
in each strand were digested with Dpn I to remove the 
starting template selectively - plasmid DNA isolated from 
most E. coli strains contain methylated bases but the 
newly synthesised DNA do not have methylated bases. 
After the digest the DNA is transformed into 
supercompetent E, coli XLl Blue cells and propagated. 
The plasmid DNA from potential mutants which grow on agar 
plates under antibiotic selection were isolated and 
sequenced to confirm mutagenesis. The cells can then be 
used for protein expression once the entire sequence of 
the new mutant was confirmed to ensure that there were no 
spurious mutations. 

Multiple mutants were prepared either by further 
mutagenesis, also by the Kunkel method, or where the 
location of the sites in the sequence peinaits, simple 
cloning steps. There are two unique restriction sites 
within the camC gene which are absent from the expression 
vector. One is Sph I which spans residues 121 - 123, and 
the other is Sal I which spans residues 338 and 339. 
Therefore, all mutations at, for example, residues 87, 
96, 98, and 101 are readily combined with mutations at 
higher number residues by ligating appropriate fragments 
from restriction digests of mutant camC genes with Nde 
1/Sph I and Sph 1/Hind III and the backbone fragment from 
a Nde 1/Sph I digest of the expression vector. Mutations 
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at/ for example, 395 and 396 can be similarly 
incorporated by digests in which Sph 1 is replaced with 
Sal I. 

The rationale for introducing the unique Xba I site 
5 is now clear: many mutants with multiple mutations were 
prepared by the cloning procedure above. Without the Xba 
I site it would be impossible to clone the gene for these 
multiple mutants from the expression vector back into M13 
for further rounds of mutagenesis. Of course these 
10 problems could be overcome by doing mutagenesis by PGR, 
for example. 



Example 2 

Substrate oxidation protocol; In vitro reactions 



15 



20 



Component 
P450^3„ enzyme 
Putidaredoxin 
Putidaredoxin reductase 
Bovine liver catalase 
KCl 

Substrate 
NADH 



Final concentration 

1 pM 
10 ]M 
1 pM 

20 pg/ml 
200 mM 

Typically 1 mM 
250 - 400 pM 



* 50 mM Tris-HCl buffer pH 7.4 is added to make up 
25 the volxame. 

* Temperature controlled at 30*C, optional. 

* The NADH turnover rate could be determined by 
monitoring the absorbance at 340 nm with time. 

* Catalase does not catalyse the substrate 

30 oxidation reactions but rather it is present to remove 
any hydrogen peroxide by-product which could otherwise 
denature the P450ca„. 

The method can be increased in scale to, for 
example, 20 ml total incubation volume to allow 

35 purification of sufficient products by HPLC for 
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spectroscopic characterisation. Fresh substrate (1 mM) 
and NADH (1-2 mM) are added periodically, such as every 
20 minutes in a total reaction time of, typically, 3 
hours . 

Example 3 

The In vivo system 

The in vivo systems were expressed using the vector 
pGLWll, a derivative of the plasmid pKK223 (Brosius, J. 
and Holy, A. Proc. Natl, Acad. Sci. C7SA, 1984, 81, 6929- 
6933) . Expression is directed by the tac promoter and the 
vector incorporates a gene conferring resistance to the 
antibiotic ampicillin. 

Two systems were constructed. The first one 
expressed the electron transfer proteins putidaredoxin 
reductase {camA gene) and putidaredoxin (camB gene) as a 
fusion protein with a seven amino acid peptide linker, 
and the P4 50cam enzyme (camC gene) was expressed by the 
same vector but it was not fused to the electron 
proteins. The second system expressed the three proteins 
as separate entities in the E.Coli host. Both systems 
were catalytically competent for substrate oxidation in 
vivo. 

The general strategy was as follows. The genes for 
the three proteins were cloned using Eco RI and Hind III 
as flanking sites, with Eco RI at the 5' end. For both 
in vivo systems there are restriction sites between the 
genes, including between the reductase and redoxin genes 
in the fusion construct. These restriction sites were 
introduced by PGR, as detailed below. The first task, 
however, was to carry out a silent mutation to remove the 
Hind III site within the camA gene for the reductase. 
The AAGCTT Hind III recognition sequence in the camA gene 
was changed to AAGCCT, which is a silent mutation because 
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GCT and GCC both encode alanine. The gene was completely 
sequenced to ensure that there were no spurious 
mutations. 

1. The fusion protein system 

I. a Manipulation of the camA gene hy FCR 

For the camA gene the primer below was used at the 
5* end of the gene to introduce the Eco RI cloning site 
and to change the first codon from GTG to the strong 
start codon ATG. 

5 ' - GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC GCA 
AAC 

Eco RI RBS I -camA 

At the 3' end of camA the primer was designed such 
that 15 bases are complementary to nucleotide sequence of 
the last five amino acid residues of camA. The stop 
codon immediately after the GCC codon for the last amino 
acid was removed, and then part of a seven amino acid 
linker (Thr Asp Gly Gly Ser Ser Ser) which contained a 
Bam HI cloning site (GGATCC = Gly Ser) was introduced. 
The coding sequence was thus: 

5'- GAA CTG AGT AGT GCC ACT GAC GGA GGA TGC TCA TCG-3' 
camA - Thr Asp Gly Gly Ser 

I Bam HI I 

The primer sequence shown below is the reverse 
complement used for PCR: 

5'- CGA TGA GGA TGC TCC GTC AGT GGC ACT ACT CAG TTC-3' 

l.b hlanlpnlatlons of the caaB gene by PCR 

For the camB gene the primer at the 5' end 
incorporated the second half of the peptide linker 
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between the reductase and redoxin proteins, and the 
restriction site Bam HI for joining the two amplified 
genes together. 

5*- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT-3' 
5 Gly Ser Ser Ser |- camB 

\Bam HI I Start 

At the 3' end of camB the primer incorporates 12 
nucleotides complementary to the end of camB followed by 
10 the stop codon TAA, a 6 nucleotide spacer before the GGAG 
ribosome binding site. Xba I and Hind 111 sites were 
then added to allow cloning of the camC gene when 
required. The sequence of the coding strand was 
therefore : 

15 5'- CCC GAT AGG CAA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT TCA 

TCG- 3 » 

CamB - 1 stop BBS Xba I Hind 111 

The primer shown below is the reverse complement used for 
20 PGR: 

5*-CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA TTG CCT ATC 
GGG -3* 

25 I.c Preparation of the full fusion construct 

The camA and cajnB genes were amplified by the PGR 
using the primers described above. The new camA was 
digested with Eco RI and Bam HI, while the new CamB was 
digested with Bam HI and Hind 111. The pGLWll expression 

30 vector was digested with jE:co RI and Hind III. All three 
were purified by agarose gel electrophoresis and the 
three gel slices containing the separate fragments were 
excised from the gel and ligated together, and then 
transformed into E.Coli DH5a. Successful ligation of all 

35 the fragments were confirmed by a series of restriction 
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digestion experiments, especially the presence of the new 
and unique Xba I site. The entire sequence of the insert 
from the Eco RI site to the Hind III site was determined 
to ensure that all the sequences were correct. 
5 The new plasmid, named pSGB^' was transformed into 

E.Coli and expression of the reductase and redoxin 
proteins was induced by IPTG. When a purified P450can, 
enzyme was added to the cell-free extract, substrate 
oxidation was observed for a variety of substrates. 

10 When the camC gene is cloned into the pSGB^ plasmid 

using the Xba I and Hind 111 restriction sites, the new 
recombinant plasmid thus generated expresses the 
reductase and redoxin as a fusion protein and the P450cam 
enzyme as a operate entity both from the same mRNA 

15 molecule. This in vivo system is catalytically competent 
for terpene oxidation in whole cells. 

2 ■ The in vivo system with the protein expressed 
20 separately 

2 .a The basic strategy 

The starting point of the preparation of this in 
vivo system was the recombinant plasmid used to express 

25 the camA gene for putidaredoxin reductase. The camA gene 
was cloned into the pGLWll plasmid using the Eco RI and 
Bam HI restriction sites, with Eco RI being at the 5' end 
of the gene. Conveniently the polylinker region of the 
pGLWIl vector has a Hind 111 site downstream of the Bam 

30 HI site. The camB gene was therefore manipulated by PGR 
such that it can be cloned into pGLWll using the Bam HI 
and Hind III sites. This new plasmid expresses the 
reductase and redoxin as separate proteins. 

The camB gene was cloned into pUCllS by the Bam HI 
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and Hind III cloning sites to express putidaredoxin for 
our general in vitro substrate oxidation work. 
Therefore, the PGR primer at the 3' end of the camB gene 
was designed to introduce a ribosome binding site and the 
Xba 1 restriction site upstream of the Hind III site so 
that the camC gene can be inserted downstream of camB 
using the Xba I and Hind III sites. Therefore the three 
genes were cloned without fusion in the pGLWll expression 
vector and arranged in the order 5' -camA-camB-camC-3 ' ^ 
and each gene has its own RBS to initiate protein 
synthesis . 

2,b Manipulations of the camB gene 

We used the internal and unique restriction site Mlu 
I (recognition sequence ACGCGT) within the camB gene as 
the starting point so that the PGR product has a 
different size from the PGR template fragment. The 
primers were as follows: 

5'- TGA TCG ACG CGT GGG GAA GTG CTG-3' 

where the Mlu I site is in bold. 

The desired coding sequence at the 3' end of the 
camB gene was : 

5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB I stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3 • 
RBS Xba 1 

After the stop codon there is a 35 base spacer 
before the RBS which is used to initiate the synthesis of 
the P450ca„ enzyme. The Xba I cloning site is located 
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within the spacer between the RBS and the start codon 
(not in this primer) of the camC gene. The PGR primer 
used was the reverse complement of the sequence above. 
The PGR was carried out and the amplified fragment of the 
5 appropriate size was purified by agarose gel 
electrophoresis and the gel slice excised. 

One extra step was necessary to complete the 
construction of the new plasmid. The plasmid for the 
fusion protein in vivo system was digested with Mlu I and 

10 Hind III restriction enzymes, purified by agarose gel 
electrophoresis, and the gel slice for the small camB 
fragment excised. The pUCllS plasmid for camB expression 
was similarly digested, and the gel slice for the 
backbone was excised. By ligating the two fragments 

15 together we prepared a new pUG118-based plasmid which had 
an Xba I site followed by an Hind III site downstream of 
the stop codon of camB. This new plasmid was digested 
with the Mlu 1 and Xba I enzymes and the baclcbone was 
ligated with the new camB fragment described above to 

20 generate a plasmid with the following arrangement of the 
key components : 

..lac Promoter . .Bam HI.. camB grene. .spacer. .RBS. .XJba 
I. .Hind 111. . 



25 2.C Preparation of the in vivo system plasmid 

Once the modified camB with the Xha I and Hind III 
restriction sites and appropriate spacers were prepared, 
the in vivo system was constructed by cloning this into 
the pGLWll-based plasmid used to express the camA gene 
30 {reductase protein) using the Bam HI and Hind 111 sites. 
The new in vivo system vector has the following 
arrangement of the key components: 

. . tac Promoter . .£co IRI. .RBS. .camA gene. .spacer. .Bam 



SUBSTITUTE SHEET (RULE 26) 



wo 00/31273 PCT/GB99/03873 

-34- 

HI . .RBS. ,camB gene spacer RBS Xba I.. Hind III.* 



This new plasmid, named pSGB*, was transformed into 
E.Coli and expression of the reductase and redoxin 
5 proteins was induced by IPTG. When a purified P450cani 
enzyme was added to the cell-free extract, substrate 
oxidation was observed for a variety of substrates. 

When the camC gene is cloned into this pSGC* plasmid 
using the Xba I and Hind III restriction sites, the new 
10 recombinant plasmid thus generated will express the three 
proteins separately, each under the direction of its own 
RBS but from the same mRNA molecule. Thus constitutes 
the in vivo system used in the vast majority of our 
terpene oxidation work. 

15 

3. Introduction of an Xba I site into pRH1091 

This is the final step to enable the camC gene to be 
cloned into the in vivo systems by the two cloning sites 
Xbal and Hind III. The Xba I site was added by PGR of 
20 the entire pRH1091 plasmid using two primers. The 

presence of these two sites will also enable cloning of 
the camC gene into M13 since both Xba I and Hind III are 
unique in camC and M13. 

The primers shown below maintain the Hind 111 
25 cloning site AAGCTT: 

5'-TCA TCG AAG CTT GGC TGT TTT-3' 
Hind 111 |- vector 

30 At the other end the coding sequence desired was: 

5'-ACA ATT TCA CAC AGGA TCT AGA C CAT ATG TCA TCG AAG CTT TCA 
TCG- 3* 

Vector -|RBS Xba I Nde 1 Hind 111 
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This sequence maintained the Nde I and Hind III sites but 
the new Xba I site was introduced upstream of the Nde I 
site. The PGR primer used was the reverse complement of 
the desired sequence: 

5'-CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG TGA AAT 
TGT-3 • 

The PGR product was then purified by agarose gel 
electrophoresis, digested with Hind III and circularised 
with T4 DNA ligase. Success of the PGR method was 
indicated by the presence of a new and unique Xba I site 
in plasmid DNA isolated from transf ormants . 

4. Cloning of camC into the in vivo systems 

All existing camC mutants were cut out of pRHlOSl- 
based expression plastids with Nde I and Hind III. The 
new vector is similarly cut with the same restriction 
enzymes and the camC gene cloned into this plasmid with 
T4 DNA ligase. This DNA is transformed into E.Coli JM109 
which then may be grown to express Pi5Q^^^. 

The camC gene is excised from the new vector using 
Xba 1 and Hind III restriction enzymes and cloned into 
either the in vivo vector systems or M13mpl9 for 
mutagenesis. 

5. In vivo expression and substrate turnover 

For protein expression, cells are grown in LBamp 
medium (tryptone 10 g/litre, yeast extract 5 g/litre, 
NaCl 10 g/litre, 50 pg/ml ampicillin) at 30''C until the 
ODeoonm reaches 1.0 - 1.2. IPTG (isopropyl-p-D- 
thiogalactopyranoside) was added to a final concentration 
of 1 v»M (from a 1 M stock in H2O) and the culture was 
incubated at 30 "^C overnight. 
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For simple screening the substrate can be added to 
culture and the incubation continued. However, due to 
impurities from the culture media the cells were 
generally washed twice with 0.5 vol. of buffer P, {KH2PO4 
5 6.4 g, K2HPO4.3H2O 25.8 g, H2O to 4 litres, pH 7.4) and 
resuspended in 0.25 vol. oxygen saturated buffer P 
containing 24 mM glucose. Substrate was added to 1 mM and 
the incubation continued at 30°C. The reaction was 
allowed to run for 24 hours with periodic additions of 

10 substrate and glucose. 

The reaction was analysed by extracting 1 ml of the 
incubation mixture with 250 ]il of ethylacetate . After 
centrifuging in a microcentrifuge at 13,000 g for 2 
minutes, 2 pi of the organic extract was injected onto a 

15 0.25 mm X 30 m DB-1 gas chromatography column in a Fisons 
GC 8000 series gas chromatograph. The samples were 
carried through the column using helium carrier gas and 
the compounds present were detected using a flame 
ionisation detector. 

20 A variety of temperature programmes were used for 

different substrate to resolve turnover products. 

Monoterpenes 



Injector temperature 
Detector temperature 
Oven temperature 



150 



25 



250 ^'C 



120 for 15 min - 200 
at 25 ^C/min, 200 for 1 



min. 



Sesqui terpenes 



30 



Injector temperature 
Detector temperature 
Oven temperature 



250 °C 



250 ^'C 



150 - 230 **C at 5 '^C/min, 
230 for 1 min. 



35 



Results for particular oxidation reactions are shown 
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in table 6 and in the Figures. 

Figure 1 shows the result of an oxidation reaction 
with camphor. The 5-ketocamphor arises from further 
oxidation of 5-exo-hydroxycamphor . As can be seen there 
5 is surprisingly little of the further oxidation occurring 
in the presence of camphor. 

Example 4 

A second In vivo expression system 

10 The cluster of genes for the expression of the three 

proteins of the P450can, system, as described in Example 3, 
were also expressed in whole E. coll cells using the pCW 
vector. This vector utilises two tac promoters arranged 
in line to increase protein expression. It has a RBS, 

15 and contains the gene conferring resistance to the 

antibiotic ampicillin (Barnes, H.J. Methods Enzymol. 
1996, 272, 3-14) . 

Both methods of expressing catalytically competent 
P450cam systems described in Example 3 were successful 

20 with the pCW vector. Thus the fusion system, where 

putidaredoxin reductase and putidaredoxin were expressed 
as a fusion protein with an oligopeptide linker, but the 
P450ca„i monooxygenase was expressed but not fused to the 
electron transfer proteins. The second system expressed 

25 all three proteins as separate entities in the same E. 
coll host. 

1. The fusion protein system 

A new plasmid was constructed by cloning the gene 
30 for the fusion of electron transfer proteins into pCW so 
that different P450can mutants could be introduced into 
the system by cloning. The 5' end oligonucleotide used 
for the PGR amplification of the cam A gene introduced 
not only the Eco RI site for cloning into the pGL Wll 
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vector but also a Nde I site which spans the ATG start 
codon of the gene (see Example 3, section la) , In the 
pCW vector there is a Nde I site positioned downstream of 
the RBS for cloning of the gene to be expressed. The pGL 
Wll vector containing the camA-camB fusion gene was 
digested with Nde I and Hind III, and the insert purified 
by agarose gel electrophoresis. The pCW vector system 
was also digested with these two enzymes and the 
linearised vector purified by the same method. The two 
fragments were ligated with DNA ligase to genera#^the 
new pCWSGBp plasmid based on the pCW vector and which 
expressed the fusion of the electron transfer proteins. 
The insert excised from the pGL Wll-based plasmid already 
contained a RBS for protein expression and an Xba I site 
just upstream of the Hind III site (see Example 3, 
section 2.c), so that the cam C gene encoding P450^^ 
mutants can be cloned using these two sites and expressed 
off the RBS. Therefore genes encoding the P450^^^ mutants 
can be excised from the modified pRH system and cloned 
into the new pCWSGBp plasmid using the Xba I and Hind 111 
sites. 

2. The three proteins expressed separately 

This system was generated in exactly the same way as 
for the fusion system. Thus the pSGB+ plasmid was 
digested with the Nde I and Hind III restriction enzymes 
and the insert cloned into the pCW vector. This new 
plasmid pCWSGB+ expressed putidaredoxin reductase and 
putidaredoxin as separate entities off the twin tac 
promoters of the pCW vector. The P450ca„ mutants were 
introduced into this vector using the Xba I and Hind III 
sites. 

3. In vivo expression and substrate turnover 
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The conditions described below were used for test 
purposes in shake flasks in a laboratory and were not 
optimised. Under proper fermenter conditions with higher 
expression and biomass the final yield of products will 
5 be much increased. 

E. coll DH5a cells harbouring either of the 
catalytically competent P4 50^,^ systems were grown from a 
single colony on an agar plate in 1 L of LBamp medium at 
30 °C until ODgoonm reached ca. 1. IPTG was added to 1 mM 

10 final concentration (from a 1 M stock in water) and the 

culture grown for a further 6 h. The final ODgoonm were in 
the range 2.0 - 2.5. Cells were harvested by 
centrifugation at 5000 g and washed once with 40 mM 
phosphate buffer, pH 7.4. The cell pellet was 

15 resuspended in 500 ml of 40 mM phosphate buffer, pH 7.4. 

Glucose was added as a 2 M stock to a final concentration 
of 100 mM, and 1 mL of the substrate (a-pinene or R- 
limonene) was added to start the reaction. The mixture 
was shaken in an open 2 L conical flask in an orbital 

20 incubator at 200 rpm. More glucose was added every 24 h 
(100 mM final concentration based on a 500 mL volume, 
from a 2 M stock) and more substrate (1 mL) was added 
every 12 h. The progress of the reaction was monitored 
by GC and the whole cell system was active for at least 5 

25 days at ambient temperatures and the minimum yield at the 
end of day 5, as assayed by extraction of the reaction 
medixim with chloroform and analysing by GC, was 100 mg of 
products without accounting for volatilisation of 
products into the atmosphere and condensation on the 

30 flask above the liquid level. In addition, no compounds 
arising from further hydroxylation at another carbon atom 
were observed when the substrate was present. 



SUBSTITUTE SHEET (RULE 26) 



wo 00A31273 



PCT/GB99/03873 



-40- 

Example 5 

In vitro and In vivo substrate oxidation bv PdSO^ ^ 
In a typical reaction in vitro (optional 30°C 
temperature control) the 1.5 mL incubation mixture 
5 contained 40 inM phosphate buffer, pH 8.0, 1 pM P450bm.3/ 
50 ug/mL of catalase, and terpene substrates were added 
as a 1 M stock: in ethanol to a final concentration of 2 
mM. NADPH was added typically to 400 ]M final 
concentration, and the rates of reaction could be 

10 monitored at 340 nm. After all the NADPH had been 

consumed, the mixture was extracted by vortexing with 0.5 
mL of chloroform, the phases separated by centrif ugation, 
and the organic phase could be analysed by GC using the 
programs described in Example 3. 

15 Catalase does not catalyse the substrate oxidation 

reaction but rather it is present to remove any hydrogen 
by-product which could otherwise denature the enzyme. The 
method can be increased in scale to, for example, 20 mL 
total incubation volume to allow purification of 

20 sufficient produces by HPLC for spectroscopic 

characterisation » Fresh substrate {1 roM) and NADPH (1-2 
mM) are added periodically, such as every 20 minutes in a 
total reaction time of, typically 3 hours. 

Since the P450b„.3 enzyme is catalytically self- 

25 sufficient, i.e., both the monooxygenase and electron 

transfer domains are in a single polypeptide, the enzyme 
as expressed in E. coli can be used for whole cell, in 
vivo substrate oxidations. The procedure described under 
Example 3 for in vivo substrate oxidation by the P450cam 

30 enzyme can also be used for the P450bm.3 enzyme. 
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(+)-Loiigifolene (•i-)-yalencene 




Isosesquicarene 



Table 2 
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Table 3. HYDROPATHY SCALE FOR AMINO ACID 
SIDE CHAINS 



Side Chain Hydropathy 



lie 


4.5 


Val 


4.2 


Leu 


3.8 


Phe 


2.8 


Cys 


2.5 


Met 


1.9 


Ala 


1.8 


Gly 


-0.4 


Thr 


-0.7 


Ser 


-0.8 


Trp 


-0.9 


Tyr 


-1.3 


Pro 


-1.6 


His 


-3.2 


Glu 


-3.5 


Gin 


-3.5 


Asp 


-3.5 


Asn 


-3.5 


Lys 


-3.9 


Arg 


-4.5 
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Table 4: P4S0.,^utants 
All mutants optionally contain the base mutation C334A. 
Single mutants; Y96A, Y96F, Y96L, Y96W, 



Double mutants: 



Y96A-P87A 


Y96F-F87A 


Y96P-V295A 


Y96L-F87A 


Y96L-A296L 


y96A-P87L 


Y96F-F87I 


Y96F-V29SL 


Y96L-F87L 


Y96L-A296F 


Y96A-F87W 


Y96F-F87L 


Y96F-V295I 


Y96L-F98W 


Y96L-V39eA 


Y96A-F98W 


Y96F-F87W 


Y96F-A296L 


Y96L-T101L 


Y96L-V396L 


Y96A-L244A 


Y96F-F98W 


Y96F-A296F 


Y96L-T101F 


Y96L-V396F 


y96A-V247A 


Y96F-T101L 


Y96P-I39SF 


Y96L-L244A 


Y96L-V396W 


Y96A-V247L 


Y96F-T10XF 


Y96F-I395G 


Y96L-L244F 




y96A-l395F 


Y96F-T185A 


Y96F-V396A 


Y96L-V247A 




Y96A-I395G 


Y96F-T185F 


Y96F-V396L 


Y96L-V247L 


Y96W-F87W 




Y96F-T185L 


Y96F-V396F 


Y96L-V247F 


Y96W-F98W 




Y96F-L244A 


Y96F-V396W 


Y96L-V247W 


Y96W*L244A 




Y96F-V247A 




Y96L-G248L 


Y96W-V247A 




Y96F-V247L 




Y96L-V295L 


Y96W-V396A 




Y96F-G248L 




Y96L-V295P 





Triple Mutants : 
Y96A-F87A-L244A 
Y96A-F87A-V247A 
Y96A-r87L-L244A 
Y96A-F87L-V247A 
Y96A-L244A-V247A 

Y96L-F87A-L244A 

Y96L-F87A-V247A 

Y96L-F87L-L244A 

Y96L-F87L-V247A 

Y96L-V247A-I395F 

Y96L-V247L-I395F 

Y96L-V247L-I39SG 

Y96L-L244A-V396L 

^96L-L244A-V396F 

Y96L-L244A-V396W 

Y96L-I.244F-V396A 



Y96L-V247A-V396L 
Y96L-V247A-V396F 
Y9 6L - V2 4 7 A- V3 9 6W 
Y96L-V247F-V396A 

Y96F-F87A-L244A 
Y96F-F87A-V247A 

796F-F87A-V247L 
Y96F-F87A-I395F 
Y96F-F87A-'I395G 
Y96F-Fa7L-V247A 
Y96F-F87L-V247L 
Y96F-F87L-I395F 
Y96F-F87W-T18SA 
Y96F-F87W-T185F 
Y96F-F87W-T1B5L 
Y96F-F87W-L244F 



Y96F-F87W-V2 47A 

Y96F-F87W-V247L 

Y96F-F87W-V247F 

Y96P-F87W-V295L 

Y96F-F87W-A296L 

Y96F-F87W-V396A 

Y96F-F87W-V396L 

Y96F-V247F-V396A 

Y96F-L244A-V3961i 

Y96F-L244A-V396F 

Y96F-L244A-V396W 

Y96F-L244F-V396A 

Y96F-V247A-V3961^ 

Y96F-V247A-V396F 

Y96F-V247A-V396W 

Y96W-F87W-F98W 
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acg act gaa acc 
Thr Thr Glu Thr 
1 

cat gtg cca gag 
His Val Pro Glu 
20 

aat ctg tct gcc 
Asn Leu Ser Ala 
35 

aac gta cog gat 
Asn Val Pro Asp 
50 

gcc act cgc ggc 
Ala Thr Arg Gly 
65 

ttt tec age gag 
Phe Ser Ser Glu 

gac ttc att ccc 
Asp Phe He Pro 
100 

gcg ctg gcc aac 
Ala Leu Ala Asn 
115 

aac egg ate cag 
Asn Arg He Gin 
130 

caa gga cag tgc 
Gin Gly Gin Cys 
145 



ata caa age aac 
He Gin Ser Asn 

5 

cac ctg gta ttc 
His Leu Val Phe 

ggc gtg cag gag 
Gly Val Gin Glu 
40 

ctg gtg tgg act 
Leu Val Trp Thr 
55 

caa ctg ate cgt 
Gin Leu He Arg 
70 

tgc ceg ttc ate 
Cys Pro Phe He 
85 

acc teg atg gat 
Thr Ser Met Asp 

caa gtg gtt ggc 
Gin Val Val Gly 
120 

gag ctg gcc tgc 
Glu Leu Ala Cys 
135 

aac ttc acc gag 
Asn Phe Thr Glu 
150 



gcc aat ctt gcc cct 
Ala Asn Leu Ala Pro 
10 

gac ttc gac atg tac 
Asp Phe Asp Met Tyr 
25 

gee tgg gca gtt ctg 
Ala Trp Ala Val Leu 
45 

cgc tgc aac ggc gga 
Arg Cys Asn Gly Gly 
60 

gag gcc tat gaa gat 
Glu Ala Tyr Glu Asp 
75 

cct cgt gaa gcc ggc 
Pro Arg Glu Ala Gly 
90 

ceg ccc gag cag cgc 
Pro Pro Glu Gin Arg 
105 

atg ceg gtg gtg gat 
Met Pro Val Val Asp 

125 

teg ctg ate gag age 
Ser Leu He Glu Ser 
140 

gac tac gcc gaa ccc 
Asp Tyr Ala Glu Pro 
155 



ctg cca ccc 48 

Leu Pro Pro 
15 

aat ceg teg 96 

Asn Pro Ser 
30 

caa gaa tea 144 

Gin Glu Ser 

cac tgg ate 192 
His Trp He 

tac cgc cac 240 
Tyr Arg His 
80 

gaa gcc tac 288 
Glu Ala Tyr 
95 

cag ttt cgt 336 
Gin Phe Arg 
110 

aag ctg gag 384 
Lys Leu Glu 

ctg cgc ceg 432 
Leu Arg Pro 

ttc ceg ata 480 
Phe Pro He 
160 
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cgc ate ttc atg ctg etc 
Arg lie Phe Met Leu Leu 
165 

ttg aaa tac eta acg gat 
Leu Lys Tyr Leu Thr Asp 
180 

tte gca gag gee aag gag 
Phe Ala Glu Ala Lys Glu 
195 

gag caa cgc agg cag aag 
Glu Gin Arg Arg Gin Lys 
210 

aac ggc cag gtc aat ggg 
Asn Gly Gin Val Asn Gly 
225 230 
atg tgt ggc ctg tta ctg 
Met Cys Gly Leu Leu Leu 
245 

etc age ttc age atg gag 
Leu Ser Phe Ser Met Glu 
260 

gag ctg ate gag cgt cec 
Glu Leu lie Glu Arg Pro 
275 

etc egg cgc ttc teg ctg 
Leu Arg Arg Phe Ser Leu 
290 

tac gag ttt cat ggc gtg 
Tyr Glu Phe His Gly Val 
305 310 
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gca ggt eta ccg gaa gaa 
Ala Gly Leu Pro Glu Glu 
170 

cag atg ace cgt ccg gat 
Gin Met Thr Arg Pro Asp 
185 

gcg etc tac gae tat ctg 
Ala Leu Tyr Asp Tyr Leu 
200 

ccg gga ace gae get ate 
Pro Gly Thr Asp Ala lie 
215 220 
cga ccg ate ace agt gae 
Arg Pro lie Thr Ser Asp 
235 

gtc ggc ggc ctg gat acg 
Val Gly Gly Leu Asp Thr 
250 

ttc ctg gcc aaa age ccg 
Phe Leu Ala Lys Ser Pro 
265 

gag cgt att eca gee get 
Glu Arg lie Pro Ala Ala 
280 

gtt gcc gat ggc cgc ate 
Val Ala Asp Gly Arg He 
295 300 
caa ctg aag aaa ggt gae 
Gin Leu Lys Lys Gly Asp 
315 
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gat ate ccg cac 528 
Asp He Pro His 
175 

ggc age atg ace 576 
Gly Ser Met Thr 
190 

ata ccg ate ate 624 
He Pro He He 
205 

age ate gtt gcc 672 
Ser He Val Ala 

gaa gcc aag agg 720 
Glu Ala Lys Arg 
240 

gtg gtc aat tte 768 
Val Val Asn Phe 

255 

gag cat cgc cag 816 
Glu His Arg Gin 
270 

tgc gag gaa eta 864 
Cys Glu Glu Leu 
285 

etc ace tec gat 912 
Leu Thr Ser Asp 

cag ate ccg eta 960 
Gin He Leu Leu 
320 
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ccg cag atg ctg tct 
Pro Gin Met Leu Ser 
325 

cac gtc gac ttc agt 
His Val Asp Phe Ser 
340 

ggc age cat ctg tgc 
Gly Ser His Leu Cys 
355 

gtc ace etc aag gaa 
Val Thr Leu Lys Glu 
370 

ccg ggt gcc cag att 
Pro Gly Ala Gin He 
385 

gca etc cct ctg gtc 
Ala Leu Pro Leu Val 
405 



.50 

ggc ctg gat gag 
Gly Leu Asp Glu 

cgc caa aag gtt 
Arg Glu Lys Val 
345 

ctt ggc cag cac 
Leu Gly Gin His 
360 

tgg ctg acc agg 
Trp Leu Thr Arg 
375 

cag cac aag age 
Gin His Lys Ser 
390 

tgg gat ccg gcg 
Trp Asp Pro Ala 



cgc gaa aac gcc tgc 
Arg Glu Asn Ala Cys 
330 

tea cac ace acc ttt 
Ser His Thr Thr Phe 
350 

ctg gcc cgc egg gaa 
Leu Ala Arg Arg Glu 
365 

att cct gac ttc tec 
He Pro Asp Phe Ser 
380 

ggc ate gtc age ggc 
Gly He Val Ser Gly 
395 

act acc aaa gcg gta 
Thr Thr Lys Ala Val 
410 414 
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ccg atg 1008 

Pro Met 

335 

ggc cac 1056 
Gly His 

ate ate 1104 
He He 

att gcc 1152 
He Ala 

gtg cag 1200 
Val Gin 
400 

1242 
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V / 1 31/11 

2 ? r?^ ^f* ^''^^ ^^^'^ ctt aaa aat tta ccg tta 

Met Tkr He Lya Glu Met Pro Gin Pro Lye Thr Phe Gly Qlu Leu Lya Asn Leu Pro Leu 

61 / 21 91 / 31 

tta aac aca gat aaa ccg gtt caa get ttg atg aaa att gcg gat gaa tta gga gaa ate 

^^f'.J^^ ^""^ ^^"^ Ala Aop Glu Leu Gly Glu He 

"1 151 / 51 

Itl f** 2?^ ^''^ ^f*" ''^^ *=9^° ^""^ cfft eta att aaa 

Phe Lys Phe Glu Ala Pro Gly Arg Val Thr Arg Tyr Leu Ser Ser Qla Aig Leu lie LyS 
iBl / 211 / 71 

^f"" 2?* ^'^^^ ^'^^^ ^"^^ *** ctt aaa ttt gta egt 

Glu Ala Cya Aap Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Ara 
241 / 81 271 / 91 

l^^ 2?* ^^"^ acg cat gaa aaa aat tgg aaa aaa gcg 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lya Asn Trp Lys Lys Ala 
301 / 101 331 / 111 

«f ^ f*^ f '^^'^ "Src ttc agt cag cag gca atg aaa ggc tat cat gcg atg atg 

His Asn He Leu Leu Pro Ser Phe Ser Gla Gin Ala Met Lye Gly Tyr His Ala Met Met 
361 ./ 121 391 / 131 

gtc gat ate gcc gtg cag ctt gtt oaa aag tgg gag egt eta aat gca gat gag cat att 
Val Asp Ala Val Gin Leu Val Gin Lys ^^^Gl^ Arg Leu Aan Ala Asp GlS Hia lie 

fr^f S""^ Hf* ^^"^ *=^^ Srat aca att ggt ctt tge gge ttt aac 

Glu Val Pro Glu Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
481 / 161 511 / 171 

tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca agt ata etc cot 
Tyr Arg Phe Asa Ser Phe Tyr Arg Asp Gla Pro His Pro Phe He Thr Ser Met Val A» 
541 / 181 571 / 15JL 

gca ctg gat gaa gca atg aac aag ctg cag cga gca aat cca gac gac cca get tat gat 
Ala Leu Asp Glu Ala Met Asa Lys Leu Gla Arg Ala Asa Pro Asp isp Pro Ala Tyr Isp 
601 / 201 €31 / 211 

gaa aac aag cgc cag ttt caa gaa gat ate aag gtg atg aac gac eta gta aat aaa att 
Glu Asn Lys Arg Gin Phe Gin Glu Asp He LyI Val Me? Asn Asp Leu Val Asp Lys He 
661 / 221 691 / 231 

att gca gat cgc aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 
He Ala Asp tog Lys Ala Ser Gly Glu Gla Ser Asp Asp Leu leu Thr His Met Leu Asa 
721 / 241 751 / 251 

gga aaa gat cea gaa aeg ggt gag ccg ctt gat gac gag aac att cgc tat caa att att 
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp GlS Asa He Alg Tyr Gla He He 
781 / 261 811 / 271 

fi^Z^ 11"^ ^^'^ ctt tta tea ttt gcg ctg tat ttc 

Thr Ph« Leu He Ala Gly His Glu Thr Thr Ser Gly Leu Leu Ser Phe All lei Tyr Phe 
B4J. / 281 871 / 291 

5^? S*^? ?** f*^ 9ca gca gaa gaa gca gca cga gtt eta gta 

^^%^f'' ^^"^ Ala Ala Glu Glu Ala Ala lig Val LeS v3 

901 / 301 931 / 311 

gat cct get cca age tac aaa caa gtc aaa oag ctt aaa tat gtc ggc atg gtc tta aac 
Asp Pro Val Pro Ser Tyr Lys Gla Val Lys Gla Leu Lys Tyr Val Gly Met Val Leu Asa 
961 / 321 991 / 331 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca aaa gaa gat acg 
/* ^«",/5Sr Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lya Glu Lp Thr 

1021 / 341 1051 / 351 *^ 

gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa eta atg gtt ctg att cct cag 

y?i,^*/ ^^y^^}Y Tyr Pro Leu Glu Lys Gly Asp Glu Leu Met Val Leu lie Pro Gla 

1081 / 361 1111 / 373^ 

111 III fo*" f" Su* ^*^Sr gaa gag ttc egt cca gag egt ttt 

Leu His Arg Asp y^a ^ 

1141 / 381 1171 / 391 

gaa aat cca agt gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt eao egt oco 

^"'^n^'''' '^'^ ^» Asa Gly III tig HI 

1201 / 401 1231 / 411 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt atg atg eta aaa 
Cys lis ain Gla Pha Ala Leu His Glu Ala Thr Leu Val Leu Gly Met Mel LeS "« 
1261 / 421 1291 / 431 

III III !-« Ill S?* S?^ ""^^ Srat att aaa gaa act tta acg tta 

/ ^^"^ ^^"^ Asp He Lys Glu Thr Leu Thr Leu 

1321 / 441 1351 / 45J 

aaa cct gaa ggc ttt gtg gta aaa gca aaa teg aaa aaa att ecg ett ggc ggt att cct 
Lys Pro Glu Gly Phe Val Val Lye Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro 
1381 / 461 1411 / 471 



SUBSTITUTE SHEET (RULE 26) 



wo 00/31273 



PCT/GB99/03873 



cat gaa gac tat ctg tat caa gaa gag ctt 
His Glu Asp Tyr Leu Tyr Oln 6lu Qlu Leu 
2881 / 96X 

ctt cat acc get ttt tct cgc atg cca aat 
Leu Hi a Thr Ala Phe Ser Arg Met Pro Aan 
2941 / 981 

gaa caa gac ggc aag aaa ttg att gaa ctt 
Glu Gin Asp Gly Lys Lys Leu Zla Glu Leu 
3001 / 1001 

gga gac gga age caa atg gca cct gcc gtt 
Gly Aap Gly Ser Gin Met Ala Pro Ala Val 
3061 / 1021 

gtt cac caa gtg agt gaa gca gac get cgc 
Val His Gin Val Ser Glu Ala Asp Ala Arg 
3121 / 1041 

ega tac gca aaa gac gtg tgg get ggg taa 
Arg Tyr Ala Lys Asp Val Trp Ala Gly OCH 
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gaa aac gcc caa age gaa ggc ate att acg 
Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
2911 / 971 

cag ceg aaa aea tac gtt cag cac gta atg 
Gin Pro Lys Thr Tyr Val Gin His Val Met 
2971 / 991 

ctt gat caa gga gcg cac tte tat att tgc 
Leu Asp Gin Gly Ala His Phe Tyr Zle Cys 
3031 / 1011 

gaa gca acg ctt atg aaa age tat get gac 
Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp 
3091 / 1031 

tta tgg ctg cag cag eta gaa gaa aaa ggc 
Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
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tea cct age act gaa cag 

Ser Pro Ser Thr Glu Gin 

1441 / 481 

acg ccg ctg ctt gtg eta 

Thr Pro Leu Leu Val Leu 

1501 / 501 

tta gca gat att gca atg 

Leu Ala Asp lie Ala Met 

1561 / 521 

gcc gga aat ctt ccg cgc 

Ala 61y Asn Leu Pro Arg 

1621 / 541 

ccg cct gat aac gca aag 

Pro Pro Aap Asn Ala Lys 

1681 / 561 

aaa ggc gtt cgc tac tec 

Lys Gly Val Arg Tyr Ser 

1741 / 581 

aaa gtg cct get ttt ate 

Lys Val Pro Ala Phe lie 

1801 / 601 

cgc ggt gaa gca gat gca 

Arg Gly Glu Ala Asp Ala 

1861 / 621 

atg tgg agt gac gta gca 

Met Trp Ser Asp Val Ala 

1921 / 641 

tct act ctt tea ctt caa 

Ser Thr Leu Ser Leu Gin 

1931 / 661 

ggt gcg ttt tea acg aac 

Gly Ala Phe Ser Thr Asn 

2041 / 681 

age acg cga cat ctt gaa 

Ser Thr Arg His Leu Glu 

2101 / 701 

tta ggt gtt att cct cgc 

Leu Gly Val lie Pro Arg 

2161 / 721 

eta gat gca tea cag caa 

Leu Asp Ala Ser Gin Gin 

2221 / 741 

etc get aaa aca gta tec 

Leu Ala Lys Thr Val Ser 

2281 / 761 

acg cgc acg cag ctt cgc 

Thr Arg Thr Gin Leu Arg 

2341 / 781 

ctt gaa gcc ttg ctt gaa 

Leu Glu Ala Leu Leu Glu 

2401 / 801 

atg ctt gaa ctg ctt gaa 

Mat Leu Glu Leu Leu Glu 

2461 / 821 

ctt ctg cca age ata egc 

Leu Leu Pro Ser lie Arg 

2521 / 841 

aaa caa gca age ate acg 

Lys Gin Ala Ser lie Thr 

2581 / 861 

tat aaa gga att gcg teg 

Tyr Lys Gly lie Ala Ser 

2641 / 881 

ttt att tec aca ccg cag 

Phe lie Ser Thr Pro Gin 

2701 / 901 

atg gtc gga ccg gga aca 

Met Val Gly Pro Gly Thr 

2761 / 921 

eta aaa gaa caa gga cag 

Leu Lys Glu Gin Gly Gin 

2821 / 941 
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gcc 


aaa 


aaa 


gca 




Axa 


Ijya 


Lys 


vax 










X* / X 


tac 


ggt 


wcn 


aat 


atg 


Tyr 


wxy 


s>ex^ 


Asn 


Met 










1531 


age 


aaa 


gga 


ttt 


gca 




Lys 


v»xy 


Phe 


Ala 










1591 


gaa 


gga 


get 


gta 


tta 


Glu 


«xy 


Ala 


Val 


Leu 










1651 




ttt 


gtc 


gac 


tgg 






vax 


Asp Trp 










1711 


gca 


b w w 


gga 


tgc 


ggc 




rue 


uxy 


Cys 


Gly 










1771 


gat 


gaa 


acg 


ctt 


gee 


ASp 


uxu 


Thr 


Leu 


Ala 










1831 


age 


gac 


gac 


ttt 


gaa 


ber 


Asp 


Asp 


Phe 


Glu 










1891 


gcc 


tac 


ttt 


aac 


etc 


AX a 


Tyr 


rAe 


Asn 


Leu 










1951 


^ ^ ^ 

www 


gtc 


gac 


age 


gcc 


irlie 


vax 


ASp 


Ser 


Ala 










2011 


gtc 


gta 


gca 


age 


aaa 




VttX 


AX a 


Ser 


Lys 










2071 


att 


gaa 


Ctt 


cca 


aaa 




\aXu 


Leu 


Pro Lys 










2131 


aae 




gaa 


gga 


ata 


Asn 


Tyr 


Glu 


Gly 


lie 










2191 


ate 






gaa 


gca 


lie 


iixg 


Leu 


Glu 


Ala 










2251 






gag 


ctt 


ctg 


Val 


Glu 


Glu 


Leu 


Leu 










2311 


gca 


atg 


get 


get 


aaa 


AX a 


new 


AX a 


Ala 


Lys 










2371 


aag 


caa 


gcc 


tac 


aaa 


Lys 


laXu 


Axa 


Tyr 


Lys 










2431 


aaa 


tac 


ccg 


gcg 


tgt 


Lya 


Tyr 




Ala Cys 










2491 


ceg 


cgc 


tac 


tac 


teg 




Arg 


Tyr 


Tyr 


Ser 










2551 


gtc 


age 


gtt 


gtc 


tea 


Vox 


Ser 


Val 


Val 


Ser 










2611 


aac 


tat 


Ctt 


gcc 


gag 


Asn 


Tyr 


Leu 


Ala 


Glu 








2671 


tea 


gaa 


ttt 


acg 


ctg 


Ser 


Glu 


Phe 


Thr 


Leu 










2731 


ggc 


gtc 


gcg 


ccg 


ttt 


Gly 


Val 


Ala 


Pro 


Phe 








2791 


tea 


ctt 


gga 


gaa 


gca 


Ser 


Leu 


Gly 


Glu 


Ala 



2851 



cgc aaa 


aag 


gca 


gaa 


Arg Lys 


Lys 


&1 A 

Axa 


uxu 


/ 


49X 






gga aca 


get 


gaa 


gga 


Gly Thr 


AX a 


IsXU 


wxy 


/ 


3XX 






ccg cag 


gtc 


gca 


acg 


Pro Gin 


Val 


Ala 




/ 


531 






att gta 


acg 


gcg 


tct 


Tl Vtti 

xxe vax 


Thr 


Ala 






551 






tta gac 


caa 


gcg 


ccc 


ijeu Asp 


Gin 


Ala 


C A«> 


/ 
/ 


571 






gat aaa 


aac 


tgg 


get 


Asp Lys 


Asn Trp 


Axa 




S91 






get aaa 


ggg 


gca 


gaa 


Ala Lys 


Gly Ala 


Glu 


/ 


611 






ggc aca 


tat 


gaa 


gaa 


Gxy Tnr 


Tyr Glu 


Glu 


/ 


631 






gac att 


gaa 


aae 


agt 


Asp lie 


Glu 


Asn 


Ser 


/ 


651 






gcg gat 


atg 


ccg 


ctt 


Ala Asp 


Met 


Pro 


T All 

iieu 


/ 


671 






gaa ctt 


caa 


cag 


cca 


Glu Leu 


Gin 


Gin 


Pro 


/ 


691 






gaa get 


tct 


tat 


eaa 


Glu Ala 


Ser Tyr 


Gin 




711 






gta aae 


cgt 


gta 


aca 


Vo 1 & an 

vax Aoxx 


Arg Val 


Thr 


/ 


731 






gaa gaa 


gaa 


aaa 


tta 


uxu uxu 


Glu Lys 


Leu 


J 
/ 


751 






caa tac 


gtg 


gag 


ctt 


Gin Tyr 


Val 


Glu 


Leu 




771 






acg gtc 


tgc 


ccg 


eeg 


Thr Val 


Cys 


Pro 


Pro 


/ 


791 






gaa caa 


gtg 


ctg 


gca 


Glu Gin 


Val 


Leu 


Ala 




811 






gaa atg 


aaa 


ttc 


age 


Qxu nee 


Lys 


Phe 


ser 




831 






att tct 


tea 


tea 


eet 


xxe s>er 


Ser 


Ser 


rro 


/ 


851 






gga gaa 


gcg 


tgg 


age 


mm 

Gly uxu 


Ala 


Trp 


Ser 




871 






ctg caa 


gaa 


gga 


gat 


Leu Gin 


Glu Gly 


ASp 


/ 


891 






cea aaa 


gac 


cct 


gaa 


Pro Lys 


Asp 


Pro 


Glu 


/ 


911 






aga ggc 


ttt 


gtg 


cag 


Arg Gly 


Phe 


Val 


Gin 


/ 


931 






cat tta 


tac 


ttc 


ggc 


His Leu 


Tyr 


Phe 


Gly 


/ 


951 
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aac 
Aon 


get 
Ala 


cat 
Eia 


aat 
Asn 


acg 
Thr 


gcg 
Ala 


cgt 
Arg 


gat 
Asp 


ctt 
Leu 


gat 
Asp 


tea 
Ser 


cac 
His 


tat 
Tyr 


aac 
Aan 


ggt 
Gly 


eat 
His 


get 
Ala 


gat 
Asp 


gaa 
Glu 


gta 
Val 


act 
Thr 


acg 
Thr 


tat 
Tyr 


caa 
Oln 


aac 
Asn 


ate 
He 


get 
Ala 


gae 
Asp 


tgg 
Trp 


cgt 
Arg 


gaa 
Glu 


cat 
Bis 


gaa 

Glu 


gat 
Asp 


aat 
Asn 


aaa 
Lys 


gcg 
Ala 


aaa 

Lys 


atg 

Met 


eac 
His 


ggc 

Gly 


agt 
Ser 


gca 
Ala 


cga 
Arg 


gaa 
Glu 


gga 
Gly 


gat 
Asp 


eat 
His 


gea 
Ala 


agg 

Arg 


tte 
Phe 


ggc 

Gly 


get 
Ala 


cat 
His 


ttg 
Leu 


cea 
Pro 


eaa 
Gin 


gat 
Asp 


cct 
Pro 


gtt 
Val 


eat 
His 


aaa 

Lys 


gta 

Val 


gag 
Glu 


aaa 
Lys 


cgt 
Arg 


tta 
Leu 


aca 
Thr 


gaa 
Glu 


ttt 
Phe 


ate 
He 


gcc 
Ala 


cgt 
Arg 


gtc 
Val 


gat 
Asp 


gaa 

Glu 


gga 

Gly 


tat 
Tyr 


gga 
Gly 


gaa 
Glu 


acg 
Thr 


att 

He 


Thr 


tOG 

cyo 


acg 
Thr 


ceg 
Pro 


ctt 
Leu 


ate 
He 


gcg 

Ala 


cgc 
Arg 


aaa 
Lys 


cag 
Gin 


tgc 
Cya 


cgt 
Arg 


tea 
Ser 


cct 
Pro 
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-54- 
CLAIMS 

1. Process for oxidising a substrate which is an 
acyclic or cyclic terpene, or a cycloalkene; or a 
substituted derivative thereof, which process comprises 

5 oxidising said compound with a mutant haem-containing 
enzyme, the mutant comprising the substitution of an 
amino acid in the active site by an amino acid with a 
less polar side-chain. 

2. Process according to claim 1 in which the 

10 enzyme is a mutant of P450can, or P450bm.3. or a mutant of a 
naturally occurring homologue of either of these enzymes. 

3. Process according to claim 2 in which the 
enzyme is one in which amino acid 47 and/or 51 of 
P450bm-3/ or amino acid 96 of P450^^^, or the equivalent 

15 amino acid{s) in a said homologue, have been changed to 
an amino acid with a less polar side-chain. 

4. Process according to any one of claims 1, 2 or 
3 in which there are one or more other amino acid 
substitutions in the active site. 

20 5. Process according to any one of the preceding 

claims in which the enzyme is (i) P450cain and comprises 
one or more of the following mutations: F87W, F87I, F87L, 
T185L, T185F, V247A, V247L or F87A-I395F; or (ii) P450b„.3 
and comprises the mutation R47L-Y51F. 

25 6. An enzyme as defined in claim 4 or 5 excluding 

mutants of P450eam which only have the mutations F87A- 
Y96G-F193A, F87A-Y96G-F193A-C334A, or T101M-T185F-V247M. 

7. A polynucleotide which comprises a sequence 
which encodes an enzyme as defined in claim 6. 

30 8. A cell which expresses: 

(i) an enzyme as defined in any one of claims 2 to 6 
which in its naturally occurring form has an electron 
transfer reductase domain, or 

(ii) (a) an enzyme as defined in any one of claims 1 to 
35 5; 
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(b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 
9. A cell which expresses: 

(a) (i) P450cam^ or a fragment thereof; or 

5 (ii) a naturally occurring homologue of P450c«n 

or a fragment thereof; or 
(iii)a mutant P450cam/ or a mutant 
homologue of thereof 

as defined in any one of claims 2 to 5; or 
10 (iv) a P450c^^ which has at least 70% amino acid 

homology with (i) , (ii) or (iii)and 
optionally has the mutations defined in 
any one of claims 3 to 5; and 

(b) an electron transfer reductase; and 
15 (c) an electron transfer redoxin: 

excluding an E.Coli DH5a cell in which the only 
mutants of P450cam which are expressed are amongst the 
following: 

H2N-P450cam-TDGTSST-putidaredoxin reductase-TDGASSS- 
20 putidaredoxin-COOH, 

H2N-P450ea„,-TDGTRPGPGPGPGPSST-putidaredoxin 
reductase-TDGASSS-putidaredoxin-COOH, 

H2N-P450ca„"-TDGTRPGPGPGPGPGPGPSST-putidaredoxin 
reductase-TDGASSS-putidaredoxin-COOH, 
25 HzN-putidaredoxin reductase-TDGASSS-putidaredoxin- 

PLEL-P450cam-COOH. 

10. A cell according to claim 8 or 9 in which (a)/ 
(b) and (c) or (b) and (c) are expressed together in the 
same fusion protein. 
30 11. A cell according to any one of claims 8 to 10 

in which: 

(b) is putidaretoxin reductase or a fragment 
thereof; and/or 

(c) is putidaretoxin or a fragment thereof. 

35 
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12. A cell which expresses: 

(a) (i) P450bm-3/ or a fragment thereof; or 

(ii) a naturally occurring homologue of P450bm-3 
or a fragment thereof; or 
5 (iii)a mutant P450b„.3, or a mutant 

homologue of thereof 

as defined in any one of claims 2 to 5. 

13. A process according to any one of claims 1 to 5 
in which the compound is oxidised in a cell according to 

10 any one of claims 8 to 12 

14. A process for making a library of mutants of 
P450cani or P450bm-3< or mutants of a homologue of either of 
these enzymes comprising contacting a cell according to 
any one of claims 9, and 10 and 11 when dependent on 

15 claim 9, or according to claim 12 or an E.Coll DH5a cell 

as defined in claim 9; 

with a mutagen and/or when then the cell is a 

mutator cell culturing the cell in conditions in which 

mutants are produced. 
20 15. Process for selecting a mutant of P450cam or 

P450BM.3r or a homologue thereof, for its ability to 

oxidise a particular substrate, which process comprises 

screening a group of said mutants for their oxidation 

effect on the particular substrate. 
25 16. Process according to claim 15 in which the 

mutant is additionally selected for its ability to 

oxidise the particular compound to a particular oxidation 

product . 

17. A process according to claim 15 or 16 in which 
30 the screening is carried out on the library made in a 

process according to claim 14. 

18. A process for producing a library of oxidation 
products comprising providing a substrate as defined in 
claim 1 to a library made in a process according to claim 

35 14 and allowing oxidation of the substrate. 
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19, An oxidation product obtained by a process 
according to any one of claims 1 to 5 and 13 wherein 
optionally the enzyme is one which has been selected in a 
process according to claim 15, 16 or 17 for use in a 

5 method of treatment of the human or animal body by 
therapy. 

20. A pharmaceutical composition comprising an 
oxidation product obtained by a process according to any 
one of claims 1 to 5 and 13 wherein optionally the enzyme 

10 is one which has been selected in a process according to 
claim 15, 16 or 17 and a pharmaceutically acceptable 
carrier or diluent. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ £10 


1. 


Claims: 1-14,17-20 all partially 






A mutant P-450 cam in which amino acid 96 has been changed 
to an amino acid with a less polar side-chain. Also a 
nucleotide encoding it. a process for oxidising a terpene or 
a cycloalkene with it, a cell expressing it. 


2. 


Claims: 1-14,17-20 all partially 






Same as invention 1, but pertaining to a mutant P-450 cam 
which amino acid 87 has been changed. 


in 


3. 


Claims: 1-14.17-20 all partially 






Same as invention 1, but pertaining to a mutant P-450 cam 
which amino acid 185 has been changed. 


in 


4. 


Claims: 1-14,17-20 all partially 






Same as invention l,.but pertaining to a mutant P-450 cam 
which amino acid 247 has been changed. 


in 


5. 


Claims: 1-14,17-20 all partially 






Sarpe as invention 1, but pertaining to a mutant P-450 cam 
which an amino acid in the acive site has been changed to 
amino acid with a less polar side-chain, as far as not 
covered by (l)-(4). 


in 
an 


6. 


Claims: 1-8.10,12-14,17-20 all partially 






Same as invention 1, but pertaining to a mutant P-450 BM3 
which amino acid 47 has been changed. 


in 


7. 


Claims: 1-8,10.12-14,17-20 all partially 






Same as invention 1. but pertaining to a mutant P-450 BM3 
which amino acid 51 has been changed. 


in 


8. 


Claims: 1-8,10,12-14.17-20 all partially 






Same as invention 1, but pertaining to a mutant P-450 BM3 
which amino acid in the acive site has been changed to an 
amino acid with a less polar side-chain, as far as not 
covered by (6) -(7). 


in 


9. 


Claims: 8-11,13,14,17-20 all partially 
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A cell which expresses P-450 cam, or a fragment thereof or a 
naturally occurring homologue or a mutant thereof, and an 
electron transfer reductase and an electron transfer redoxin 



10. Claims: 8,10-14.17-20 all partially 

A cell which expresses P-450 BM-3, or a fragment thereof or 
a naturally occurring homologue or a mutant thereof. 



11. Claims: 14,17-20 all partially 

A process for making a library of mutants of P-450 cam or 
P-450 BM3 by treating cells expressing P-450 cam or P-450 
BM3 with a mutagen or when the cells are mutator cells, 
culturing them in conditions in which mutants are produced 



12. Claims: 15,16 all completely, 17,19,20 all partially 

A process for selecting a mutant of P-450 cam or P-450 BM3 
for Its ability to oxidise a particular substance, 
comprising screening a group of mutants for their oxidation 
effect on the particular substance. 
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